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Introduction and Dedication 



This book is dedicated to Paul Erdos, the greatest mathematician I 
have ever known, whom it has been my rare privilege to consider 
colleague, collaborator, and dear friend. 

I like to think that Erdos, whose mathematics embodied the princi- 
ples which have impressed themselves upon me as defining the true 
character of mathematics, would have appreciated this little book 
and heartily endorsed its philosophy. This book proffers the thesis 
that mathematics is actually an easy subject and many of the famous 
problems, even those in number theory itself, which have famously 
difficult solutions, can be resolved in simple and more direct terms. 

There is no doubt a certain presumptuousness in this claim. The 
great mathematicians of yesteryear, those working in number the- 
ory and related fields, did not necessarily strive to effect the simple 
solution. They may have felt that the status and importance of mathe- 
matics as an intellectual discipline entailed, perhaps indeed required, 
a weighty solution. Gauss was certainly a wordy master and Euler 
another. They belonged to a tradition that undoubtedly revered math- 
ematics, but as a discipline at some considerable remove from the 
commonplace. In keeping with a more democratic concept of intelli- 
gence itself, contemporary mathematics diverges from this somewhat 
elitist view. The simple approach implies a mathematics generally 
available even to those who have not been favored with the natural 
endowments, nor the careful cultivation of an Euler or Gauss. 



vii 




Introduction and Dedication 



viii 

Such an attitude might prove an effective antidote to a generally 
declining interest in pure mathematics. But it is not so much as incen- 
tive that we proffer what might best be called “the fun and games” 
approach to mathematics, but as a revelation of its true nature. The 
insistence on simplicity asserts a mathematics that is both “magi- 
cal” and coherent. The solution that strives to master these qualities 
restores to mathematics that element of adventure that has always 
supplied its peculiar excitement. That adventure is intrinsic to even 
the most elementary description of analytic number theory. 

The initial step in the investigation of a number theoretic item 
is the formulation of “the generating function”. This formulation 
inevitably moves us away from the designated subject to a consider- 
ation of complex variables. Having wandered away from our subject, 
it becomes necessary to effect a return. Toward this end “The Cauchy 
Integral” proves to be an indispensable tool. Yet it leads us, inevitably, 
further afield from all the intricacies of contour integration and they, 
in turn entail the familiar processes, the deformation and estimation 
of these contour integrals. 

Retracing our steps we find that we have gone from number theory 
to function theory, and back again. The journey seems circuitous, yet 
in its wake a pattern is revealed that implies a mathematics deeply 
inter-connected and cohesive. 




I 



The Idea of Analytic Number 
Theory 



The most intriguing thing about Analytic Number Theory (the use of 
Analysis, or function theory, in number theory) is its very existence! 
How could one use properties of continuous valued functions to de- 
termine properties of those most discrete items, the integers. Analytic 
functions? What has differentiability got to do with counting? The 
astonishment mounts further when we learn that the complex zeros 
of a certain analytic function are the basic tools in the investigation 
of the primes. 

The answer to all this bewilderment is given by the two words 
generating functions . Well, there are answers and answers. To those 
of us who have witnessed the use of generating functions this is a kind 
of answer, but to those of us who haven’t, this is simply a restatement 
of the question. Perhaps the best way to understand the use of the 
analytic method, or the use of generating functions, is to see it in 
action in a number of pertinent examples. So let us take a look at 
some of these. 



Addition Problems 

Questions about addition lend themselves very naturally to the use of 
generating functions. The link is the simple observation that adding 
m and n is isomorphic to multiplying z m and z n . Thereby questions 
about the addition of integers are transformed into questions about 
the multiplication of polynomials or power series. For example, La- 
grange’s beautiful theorem that every positive integer is the sum of 



1 
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four squares becomes the statement that all of the coefficients of the 
power series for 1 + z + z 4 + • • • + z are positive. How 
one proves such a fact about the coefficients of such a power series 
is another story, but at least one begins to see how this transition 
from integers to analytic functions takes place. But now let’s look at 
some addition problems that we can solve completely by the analytic 
method. 



Change Making 

How many ways can one make change of a dollar? The answer is 
293, but the problem is both too hard and too easy. Too hard because 
the available coins are so many and so diverse. Too easy because it 
concerns just one “changee,” a dollar. More fitting to our spirit is the 
following problem: How many ways can we make change for n if the 
coins are 1, 2, and 3? To form the appropriate generating function, 
let us write, for |z| < 1, 

A- = l + z + z' +1 + z 1 +'+' + ..., 

1 - Z 

C-T = 1 + + z 2 + 2+2 + ■ • ■ , 

1 - Z 2 

— L- = 1 + 7 3 + Z 3 + 3 + Z 3 + 3 + 3 + . . . , 

i - c 

and multiplying these three equations to get 
1 

(1 - z)(l - z 2 )(l - Z 3 ) 

= (1 + z + z 1+1 + • • 0(1 + Z 2 + Z 2+2 + • • •) 

X (1 + Z 3 + Z 3+3 + •••)• 

Now we ask ourselves: What happens when we multiply out the 
right-hand side? We obtain terms like z 1+ i+i+i . z 2 . A+-\ On the one 
hand, this term is z 12 , but, on the other hand, it is z fourl ' s + one2 + two3 ' s 
and doesn’t this exactly correspond to the method of changing the 
amount 12 into four l’s, one 2, and two 3’s? Yes, and in fact we 
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see that “every” way of making change (into l’s, 2’s, and 3’s) for 
“every” n will appear in this multiplying out. Thus if we call C in) the 
number of ways of making change for n, then C (n ) will be the exact 
coefficient of z n when the multiplication is effected. (Furthermore 
all is rigorous and not just formal, since we have restricted ourselves 
to |z| < 1 wherein convergence is absolute.) 

Thus 



J2 C <»)z" 



1 

(1 -z)( 1 -z 2 )(l -z 3 )’ 



( 1 ) 



and the generating function for our unknown quantity C(n) is 
produced. Our number theoretic problem has been translated into 
a problem about analytic functions, namely, finding the Taylor 
coefficients of the function - — w , D,, — j- . 

Fine. A well defined analytic problem, but how to solve it? We must 
resist the temptation to solve this problem by undoing the analysis 
which led to its formulation. Thus the thing not to do is expand yrr , 
Y 33 , respectively into z a > 12 z 2b 212 z 3c and multiply only to 
discover that the coefficient is the number of ways of making change 
for n . 

The correct answer, in this case, comes from an algebraic tech- 
nique that we all learned in calculus, namely partial fractions. Recall 
that this leads to terms like for which we know the expan- 

sion explicitly (namely, (1 _^_ )A . is just a constant times the (k — l)th 
derivative of ,, 1 . = V a n z n ). 

Carrying out the algebra, then, leads to the partial fractional 
decomposition which we may arrange in the following form: 

1 

(1 - z)(l - z 2 )( 1 - z 3 ) 

1 1 11 11 11 
6 (1 - z) 3 + 4 (1 - z) 2 + 4 (1 - z 2 ) + 3 (1 - z 3 ) ■ 



Thus, since 



1 

(1 - z ) 2 



d 1 
dz 1 - z 




+ l)z" 
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and 



1 

(1 - z ) 3 
C(n) 



d 1 d v •\ n + 1 „ 

dz 2(1 — z) 2 dz ^ 2 

i n + 2)(n + 1) „ 

2^ 2 Z ’ 

(n + 2)(n + T) n + 1 XiO) X2fi0 
12 + 4 + 4 + 3 



where Xi( n ) = 1 if 2 | n and = 0 otherwise; X 2 <fi) = 1 if 3 | n 
and = 0 else. A somewhat cumbersome formula, but one which can 
be shortened nicely into 



C(n) = 





(3) 



where the terms in the brackets mean the greatest integers. 

A nice crisp exact formula, but these are rare. Imagine the tness 
that occurs if the coins were the usual coins of the realm, namely 1,5, 
10, 25, 50, (100?). The right thing to ask for then is an “asymptotic” 
formula rather than an exact one. 

Recall that an asymptotic formula F{n) for a function fin) is one 
for which lim„^oo fFl — ] . j n the colorful language of E. Landau, 
the relative error in replacing f(n) by F(n) is eventually 0%. At 
any rate, we write fin) ~ F(n) when this occurs. One famous such 

example is Stirling’s formula n\ ~ -JItui ( - )". (Also note that our 

2 

result (3) can be weakened to C(n) ~ .) 

So let us assume quite generally that there are coins a.\, a 2 , a 3 , . . ., 
a k , where to avoid trivial congruence considerations we will require 
that there be no common divisiors other than 1 . In this generality we 
ask for an asymptotic formula for the corresponding C(n). As before 
we find that the generating function is given by 



C(n)z n 



1 

(1 — Z fll )(l — z ai ) ■ ■ ■ (1 — z ak ) 



(4) 



But the next step, explicitly finding the partial fractional decompo- 
sition of this function is the hopeless task. However, let us simply 
look for one of the terms in this expansion, the heaviest one. Thus 
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at z = 1 the denominator has a k-fold zero and so there will be a 
term . All the other zeros are roots of unity and, because we 
assumed no common divisiors, all will be of order lower than k. 

Thus, although the coefficient of the term {l ^ )k is c(" ' k k ( 1 ) , the 
coefficients of all other terms ,, - will be aco j ( n+J ,). Since all of 

(l-cozy \j— 1/ 

these j are less than k, the sum total of all of these terms is negligible 
compared to our heavy term , ')• In short C(n) ~ or 

even simpler, 

n k ~ l 

c(n) ~ c (Fr Tjy 

But, what is c? Although we have deftly avoided the necessity of 
finding all of the other terms, we cannot avoid this one (it’s the whole 
story!). So let us write 



1 



(1 — z ai )(l — z° 2 ) •••(!— z ak ) 



multiply by (1 — z) k to get 



+ other terms, 

(1 - z) k 



1 - z 1 - z 



1 — z a ‘ 1 — z ai 



1 - z 



1 — z ak 



= c + (1 - z) k X 



other terms, 



and finally let z — » 1 . By L’HopitaTs mle, for example, — > y 

whereas each of the other terms times (1 — z) k goes to 0. The final 
result is c = — - — , and our final asymptotic formula reads 

a\a 2 —a k ’ J r 



C(n ) 



n 



k - 1 



a { a 2 ■ ■ ■ au{k — 1)! 



( 5 ) 



Crazy Dice 

An ordinary pair of dice consist of two cubes each numbered 1 
through 6. When tossed together there are altogether 36 (equally 
likely) outcomes. Thus the sums go from 2 to 12 with varied 
numbers of repeats for these possibilities. In terms of our ana- 
lytic representation, each die is associated with the polynomial 
z + z 2 + z^ + z A + z 5 + z 6 ■ The combined possibilities for the 
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sums then are the terms of the product 

(z + z 2 + z 3 + z 4 + z 5 + z 6 )(z + z 2 + z 3 + z 4 + z 5 + z 6 ) 

= z 2 + 2z 3 + 3z 4 + 4z 5 + 5z 6 + 6z 7 
+ 5 z 8 + 4z 9 + 3z 10 + 2z u + z 12 

The correspondence, for example, says that there are 3 ways for the 
10 to show up, the coefficients of z 10 being 3, etc. The question is: Is 
there any other way to number these two cubes with positive integers 
so as to achieve the very same alternatives? 

Analytically, then, the question amounts to the existence of 
positive integers, a\, . . . , a 6 : b\, . . . , fo 6 , so that 

(z ai + • • • + z a6 )(z bl + • • • + z b 6 ) 

= z 2 + 2z 3 + 3z 4 + • • • + 3z 10 + 2z 11 + z 12 . 

These would be the “Crazy Dice” referred to in the title of this sec- 
tion. They look totally different from ordinary dice but they produce 
exactly the same results! 

So, repeating the question, can 

(z ai + • • • + z a6 )(z bl + • • • + z b 6 ) 

= (z + 2 2 + Z 3 + Z 4 + 2 5 + Z 6 ) (6) 

x (z + z 2 + z 3 + z 4 + z 5 + z 6 )? 

To analyze this possibility, let us factor completely (over the ratio- 
nals) this right-hand side. Thus z + z 2 + z 3 + z 4 + z 5 + z 6 = z = 
z(l +z + z 2 )(l +z 3 ) = z(l + z + z 2 )(l+z)(l-z + z 2 ).Weconclude 
from (6) that the “« -polynomial” and “h-polynomial” must consist of 
these factors. Also there are certain side restrictions. The a’s and /?’ s 
are to be positive and so a z-factor must appear in both polynomials. 
The a -polynomial must be 6 at z = 1 and so the (1 + z + z 2 ) (1 + z) 
factor must appear in it, and similarly in the /^-polynomial. All that 
is left to distribute are the two factors of 1 — z + z 2 . If one apiece are 
given to the a- and h-polynomials, then we get ordinary dice. The 
only thing left to try is putting both into the a -polynomial. 
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This works! We obtain finally 

J2 z a = z(l + z + z 2 )(l + z)(l - z + z 2 ) 2 
— z + z 3 + z 4 + z 5 + z 6 + z 8 

and 

^ z b — z(l + z + z 2 )(l + z) = z + 2z 2 + 2z 3 + z 4 . 

Translating back, the crazy dice are 1,3, 4, 5, 6, 8 and 1,2, 2, 3, 3, 4. 

Now we introduce the notion of the representation function . So, 
suppose there is a set A of nonnegative integers and that we wish to 
express the number of ways in which a given integer n can be written 
as the sum of two of them. The trouble is that we must decide on 
conventions. Does order count? Can the two summands be equal? 
Therefore we introduce three representation functions. 

r(n ) = #{(a, a') : a, a' £ A, n = a + a'}: 

So here order counts, and they can be equal; 

r + (n) = #{(«, a') : a, a' £ A, a < a', n — a + a'}, 

order doesn’t count, and they can be equal; 

r_(n) = #{(a, a') : a, a' £ A, a < a', n = a + a’}, 

order doesn’t count, and they can’t be equal. In terms of the generat- 
ing function for the set A, namely, A(z) = J2aeA z °’ we can express 
the generating functions of these representation functions. 

The simplest is that of r(n), where obviously 

^r(n)z n = A 2 (z). (7) 

To deal with r_(n), we must subtract A(z 2 ) from A 2 (z) to remove 
the case of a = a' and then divide by 2 to remove the order. So here 

J^r_(n)z n = l -[A 2 (z) - A(z 2 )]. 



( 8 ) 
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Finally for r + (n), we must add A(z 2 ) to this result to reinstate the 
case of a — a' , and we obtain 

^r+OOz" = ^[A 2 (z) + A(z 2 )]. (9) 

Can r(n ) be “constant?” 

Is it possible to design a nontrivial set A, so that, say, r + (n) is the same 
for all «? The answer is NO, for we would have to have 0 e A. And 
then 1 e A, else r+(l) ^ r + (0). And then 2 A, else r + (2) = 2. 
And then 3 e A, else r + (3) = 0 (whereas r+(l) = 1), then 4 ^ A, 
else r+(4) = 2. Continuing in this manner, we find 5 e A. But now 
we are stymied since now 6 = 1 + 5, 6 = 3 + 3, and r + (6) = 2. 

The suspicion arises, though, that this impossibility may just be 
a quirk of “small” numbers. Couldn’t A be designed so that, except 
for some misbehavior at the beginning, r + (n) = constant? 

We will analyze this question by using generating functions. So, 
using (9), the question reduces to whether there is an infinite set A 
for which 

^ [A 2 (z) + A(z 2 )] = P( Z ) + , (10) 

2 1 — z 

P(z) is a polynomial. 

Answer: No. Just look what happens if we let z — > (— 1) + . Clearly 
P{z) and y—_ remain bounded, A 2 (z) remains nonnegative, and 
A(z 2 ) goes to A(l) = oo, a contradiction. 



A Splitting Problem 

Can we split the nonnegative integers in two sets A and B so that 
every integer n is expressible in the same number of ways as the 
sum of two distinct members of A, as it is as the sum of two distinct 
members of B1 

If we experiment a bit, before we get down to business, and begin 
by placing 0 e A, then 1 e B, else 1 would be expressible as 
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a + a' but not as b + b' . Next 2 e B, else 2 would be a + a' but 
not b + b' . Next 3 e A, else 3 would not be a + a' whereas it 
is b + b' = 1+2. Continuing in this manner, we seem to force 
A — {0, 3, 5, 6, 9, • • •} and B = {1, 2, 4, 7, 8, • • •}. But the pattern 
is not clear, nor is the existence or uniqueness of the desired A, B. We 
must turn to generating functions. So observe that we are requiring 
by (8) that 

^[A 2 (z) - A(z 2 )] = \[B 2 {z) - B(z 2 )l (11) 

Also, because of the condition that A, B be a splitting of the 
nonnegatives, we also have the condition that 

A(z) + B(z) = — . (12) 

1 - z 

From (1 1) we obtain 

A 2 (z) - B 2 (z) = A(z 2 ) - B(z 2 ), (13) 

and so, by (12), we conclude that 

[A(z) - B(z )] • — = A(z 2 ) - B(z 2 ), 

1 - z 

or 



A(z) - B(z) = (1 - z)[A(z 2 ) - B(z 2 )]. (14) 

Now this is a relationship that can be iterated. We see that 
A(z 2 ) - B(z 2 ) = (1 - z 2 )[A(z 4 ) - B(z 4 )], 



so that continuing gives 

A(z) - B(z) = (1 - z)( 1 - z 2 )[A(z 4 ) - B(z 4 )]. 
And, if we continue to iterate, we obtain 



A(z) - B(z) = (1 - z)(l - z 2 ) • • • (1 - z 2 "") 



A(z r ) - B(r")] . 

(15) 




10 I. The Idea of Analytic Number Theory 

and so, by letting n — > oo, since A(0) = 1, B(0) = 0, we deduce 
that 

OO 

A(z) - B(z) = Y[(\ - z 2 ‘). (16) 

i= 0 

And this product is easy to “multiply out”. Every term z n occurs 
uniquely since every n is uniquely the sum of distinct powers of 2. 
Indeed z. n occurs with coefficient + 1 ifn is the sum of an even number 
of distinct powers of 2, and it has coefficient — 1, otherwise. 

We have achieved success! The sets A and B do exist, are unique, 
and indeed are given by A — Integers, which are the sum of an even 
number of distinct powers of 2, and B — Integers, which are the sum 
of an odd number of distinct powers of 2. This is not one of those 
problems where, after the answer is exposed, one proclaims, “oh, of 
course.” It isn’t really trivial, even in retrospect, why the A and B 
have the same r_(n), or for that matter, to what this common r_ (n) 
is equal. (See below where it is proved that r _(2. lk+l — 1) = 0.) 

A = Integers with an even number of l’s in radix 2. Then and 
only then 



2k+\ 




is not the sum of two distinct A’s. 

Proof. A sum of two A’s, with no carries has an even number of 

odd 

1 ’s (so it won’t give 1 1 1 • • • 1), else look at the first carry. This gives 
a 0 digit so, again, it’s not 1 1 • • • 1 . 

So r_( 2 2k+l — 1) = 0. We must now show that all other n have 
a representation as the sum of two numbers whose numbers of 1 
digits are of like parity. First of all if n contains 2k 1 ’s then it is the 
sum of the first k and the second k. Secondly if n contains 2k + 1 
1 ’s but also a 0 digit then it is structured as 1 1 1 • • • o A where A 

m 

contains 2k + 1 — m 1 ’s and, say, is of total length L then it can be 
expressed as 1 1 1 • • • 1 o 00 • • • 00 plus 1 A and these two numbers 



m— 1 



2 




An Identity of Euler’s 



II 



have respectively m l’s and 2k + 2 — m l’s. These are again of like 
parity so we are done. 



An Identity of Euler’s 

Consider expressing n as the sum of distinct positive integers, i.e., 
where repeats are not allowed. (So For n = 6, we have the expression 
1+2 + 3 and also 2 + 4, 1 + 5, and just plain 6 alone.) 

Also consider expressing n as the sum of positive odd numbers, 
but this time where repeats are allowed. (So for n — 6, we get 1 + 5, 
3 + 3, 1 + 1 + 1 + 3, l + l + l + l + l + l.) In both cases we 
obtained four expressions for 6, and a theorem of Euler’s says that 
this is no coincidence, that is, it says the following: 

Theorem. The number of ways of expressing n as the sum of distinct 
positive in tegers equals the number of ways of expressing n as the 
sum of (not necessarily distinct) odd positive in tegers. 



To prove this theorem we produce two generating functions. The 
latter is exactly the “coin changing” function where the coins have 
the denominations 1, 3, 5, 7, ... . This generating function is given 
by 

1 

• (17) 

(1 - z)(l - z 3 )(l - z 5 ) • • • 

The other generating function is not of the coin changing variety 
because of the distinctness condition. A moment’s thought, however, 
shows that this generating function is given as the product of 1 + z, 
1 + z 2 , 1 + z 3 , .... For, when these are multiplied out, each z k factor 
occurs at most once. In short, the other generating function is 

(1 + z)(l + Z 2 )(1 + z 3 )---. (18) 



Euler’s theorem in its analytic form is then just the identity 



1 

(1 - z)(l - z 3 ) (1 - z 5 ) • • • 

throughout |z | < 1. 



(1 + z) ( 1 +Z 2 )(1 + z 3 )--- 

(19) 
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Another way of writing (19) is 

(1 - z)( 1 - z 3 )(l - z 5 ) • • • (1 + z)(l + z 2 )(l + z 3 ) • • • = 1 (20) 

which is the provocative assertion that, when this product is 
multiplied out, all of the terms (aside from the 1) cancel each other! 

To prove (2) multiply the 1 — z by the 1 + z (to get 1 — z 2 ) and 
do the same with 1 — z 3 by 1 + z 3 , etc. This gives the new factors 
1 — z 1 , 1 — z 6 , 1 — z 10 , • • • and leaves untouched the old factors 
1 + z 2 , 1 + z 4 , 1 + z 6 , ■ ■ ■■ These rearrangements are justified by 
absolute convergence, and so we see that the product in (20), call it 
P(z), is equal to 

(1 - z 2 )(l - z 6 )(l - z 10 ) •••(! + z 2 )(l + z 4 ) • • • 

which just happens to be P(z 2 )! So P(z) = P(z 2 ) which of course 
means that there can’t be any terms az k , a ^ 0, k ^ 0, in the 
expansion of P(z), i.e., P(z) is just its constant term 1, as asserted. 



Marks on a Ruler 

Suppose that a 6” ruler is marked as usual at 0, 1, 2, 3, 4, 5, 6. 
Using this ruler we may of course measure any integral length from 
1 through 6. But we don’t need all of these markings to accomplish 
these measurements. Thus we can remove the 2, 3, and 5, and the 
marks at 0, 1, 4, 6 are sufficient. (The 2 can be measured between 4 
and 6, the 3 can be gotten between 1 and 4, and the 5 between 1 and 
6.) Since ( 4 ) = 6, this is a “perfect” situation. The question suggests 
itself then, are there any larger perfect values? In short, can there 
be integers a\ < a 2 < ■ ■ ■ < a n such that the differences a, — aj, 
i > j, take on all the values 1, 2, 3, ... , (")? 

If we introduce the usual generating function A(z) = YH= l z “ ! ’ 
then the differences are exposed, not when we square A(z), but when 
we multiply A(z) by A(-). Thus A(z) ■ A(-) — Yll j = i z a ‘~ aj and 
if we split this (double) sum as i > j,i — j, and i < j, we obtain 

z ai ~ aj +n + 

i>j *<j 
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Our “perfect ruler,” by hypothesis, then requires that the first sum be 
equal to J2k=\ z k , N = ("), and since the last sum is the same as 
first, with 7 replacing z, our equation takes the simple form 



aw - a G) = £/ + "-‘' w= G)' 



k=—N 

or, summing this geometric series, 

( 1 \ z N+l - z~ N 

Afe) ' A U ) = -rn- 



+ n - 1, N — 



n 



( 21 ) 



w 



In search of a contradiction, we let z lie on the unit circle z ■ 
so that the left side of (21) becomes simply \A(e' e )\ 2 , whereas the 
right-hand side is 



z N+\ _ z -(N+\) 

7 ~ 

Z 2 — Z 2 

and ( 21 ) reduces to 



+ n — 1 = 



sin(A + 2)0 

t — b n — 1 

sin 7 0 



A(e w ) 



sin ^±^0 
sin \9 



+ n — 1 . 



(22) 



A contradiction will occur, then, if we pick a 0 which makes 



sin ’^^0 
sin 7 0 



< ~(n - 1 ). 



(23) 



(And we had better assume that n > 5, since we saw the perfect 
ruler for n = 4.) 

A good choice, then, is to make sin ,r ~” +1 0 = —1, for exam- 
ple by picking 0 = . In that case sin f < f , — ^ > | , 

— — V < — | = — 2 ”~ 3 ^ ,i+2 . and so the requirement (23) follows 

from — 2,r ~ 2 ” +2 < —{n— 1) or 2 n 2 — 2 n + 2 > 2>7t{n — 1). But 
2n 2 — 2n + 2 — 37r(n — 1) > 2n 2 — 2n + 2 — 10(n — 1) = 
2 {n — 3) 2 — 6 > 2 • 2 2 — 6 = 2, for n > 5. There are no perfect 
rulers! 
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Dissection into Arithmetic Progressions 

It is easy enough to split the nonnegative integers into arithmetic 
progressions. For example they split into the evens and the odds or 
into the progressions In, 4n + 1, An + 3. Indeed there are many 
other ways, but all seem to require at least two of the progressions 
to have same common difference (the evens and odds both have 2 as 
a common difference and the An + 1 and An + 3 both have 4). So 
the question arises Can the positive integers be split into at least two 
arithmetic progressions any two of which have a distinct common 
difference? 

Of course we look to generating functions for the answer. The 
progression an + b, n — 0, 1 , 2, . . . will be associated with the 
function z an+h . Thus the dissection into evens and odds cor- 
responds to the identity J2T=o z> ' = z 2n + J2T=o z 2n+ ' > and 

the dissection into 2n, An + 1, An + 3 corresponds to z n = 
z 2n + J2T=o z 4n+l + J2T=o z 4n+ 3 > etc. Since each of these series 
is geometric, we can express their sums by z.“ n+h — j—j, . Our 
question then is exactly whether there can be an identity 

1 z bl z b2 z bk 

— + + • • • + , 

1 — z 1 — z ai 1 — z“ 2 1 — z° k 

1 < a\ < «2 < . . . < a\. (24) 

Well, just as the experiment suggested, there cannot be such a dis- 
section, (24) is impossible. To see that (24) does, indeed, lead to a 

2 7ti 

contradiction, all we need do is let z —> e “ k and observe that then 
all of the terms in (24) approach finite limits except the last term 
-j-yk- which approaches oo. 

Hopefully, then, this chapter has helped take the sting out of the 
preposterous notion of using analysis in number theory. 
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Problems for Chapter I 

1. Produce a set A such that r(n ) > 0 for all n in 1 < n < N, but 
with | A | < *J4N + 1. 

2. Show that every set satisfying the conditions of (1) must have 
| A | < <JN. 

3 . Show directly, with no knowledge of Stirling’s formula, that n ! > 

( n\n 

^ e ' 




II 

The Partition Function 



One of the simplest, most natural, questions one can ask in arithmetic 
is how to determine the number of ways of breaking up a given inte- 
ger. That is, we ask about a positive integer n : In how many ways can 
it be written as a + b + c + ■■ ■ where a, b, c, ... are positive inte- 
gers? It turns out that there are two distinct questions here, depending 
on whether we elect to count the order of the summands. If we do 
choose to let the order count, then the problem becomes too simple. 
The answer is just 2” _1 and the proof is just induction. Things are 
incredibly different and more complicated if order is not counted! 

In this case the number of breakups or “partitions” is 1 for n — 1, 
2 for n — 2, 3 for n — 3,5 for n — 4, 7 for n — 5, e.g., 5 has the 
representations l + l + l + l + l, 2+1 + 1 + 1,3 + 1 + 1,4+1, 
5, 3+2, 2 + 2 + 1, and no others. Remember such expressions 
asl + 1+ 2+1 are not considered different. The table can be 
extended further of course but no apparent pattern emerges. There 
is a famous story concerning the search for some kind of pattern in 
this table. This is told of Major MacMahon who kept a list of these 
partition numbers arranged one under another up into the hundreds. 
It suddenly occurred to him that, viewed from a distance, the outline 
of the digits seemed to form a parabola! Thus the number of digits 
in pin), the number of partitions of n, is around C^fn, or pin) itself 
is very roughly e a ^ . The first crude assessment of p(n)\ 

Among other things, however, this does tell us not to expect any 
simple answers. Indeed later research showed that the true asymptotic 

formula for pin) is e -^^~ , certainly not a formula to be guessed! 
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Now we turn to the analytic number theory derivation of this 
asymptotic formula. 



The Generating Function 

To put into sharp focus the fact that order does not count, we may 
view p(n) as the number of representations of n as a sum of 1 ’s and 
2’s and 3’s . . . , etc. But this is just the “change making” problem 
where coins come in all denominations. The analysis in that problem 
extends verbatim to this one, even though we now have an infinite 
number of coins, So we obtain 

OO OO | 

= n t— r at 

77=0 k= 1 4 

valid for |z | < 1, where we understand that p(0) — 1. 

Having thus obtained the generating function, we turn to the sec- 
ond stage of attack, investigating the function. This is always the 
tricky (creative?) part of the process. We know pretty well what kind 
of information we desire about p(n): an estimate of its growth, per- 
haps even an asymptotic formula if we are lucky. But we don’t know 
exactly how this translates to the generating function. To grasp the 
connection between the generating function and its coefficients, then, 
seems to be the paramount step. How does one go from one to the 
other? Mainly how does one go from a function to its coefficients? 

It is here that complex numbers really play their most important 
role. The point is that there are formulas (for said coefficients). Thus 
we learned in calculus that, if f(z ) = a nZ n , then a n = J ! , 

expressing the desired coefficients in terms of high derivatives of the 
function. But this a terrible way of getting at the thing. Except for 
rare “made up” examples there is very little hope of obtaining the nth 
derivative of a given function and even estimating these derivatives 
is not a task with very good prospects. Face it, the calculus approach 
is a flop. 

Cauchy’s theorem gives a different and more promising approach. 
Thus, again with f(z ) = a nZ n , this time we have the formula 
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a n — ^-r f c z^jdz, an integral rather than a differential operator! 
Surely this is a more secure approach, because integral operators are 
bounded, and differential operators are not. The price we pay is that 
of passing to the complex numbers for our z’s. Not a bad price, is it? 

So let us get under way, but armed with the knowledge that the 
valuable information about f{z) will help in getting a good approx- 
imation to f c dz. But a glance at the potentially explosive 
shows us that C had better stay as far away from the origin as it can, 
i.e., it must hug the unit circle. Again, a look at our generating func- 
tion P( n )z n shows that it’s biggest when z is positive (since the 
coefficients are themselves positive). All in all, we see that we should 
seek approximations to our generating function which are good for 
|z| near 1 with special importance attached to those z’s which are 
near + 1 . 



The Approximation 



Starting with (1), F(z) — n^li phr > and taking logarithms, we 
obtain 



log F(z ) = 



OO OO ^kj 



OO 1 OO OO 

k= 1 4 k= 1 y'=l J 

OO 1 OO OO 1 /' 

E ’ E*'‘ = E ’ A- 

^ 7 ft " / 1 - z J 



7=1 J k = 1 



“ J 



(2) 



Now write z — e w so that > 0 and obtain log F(e w ) — 
J2T= i i • Thus noticing that the expansion of begins with 
2 — f + c\x + • • • or equivalently (near 0) 2 — ?-d- + cx + ■ ■ 
we rewrite this as 



log F(<T”) = £ i 






g—kw 



e kw — 1 kw 



+ 



2 



( 3 ) 
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= t + -2 108(1 - ^ 

1 / 1 1 e~ kw \ 

+ ^ k y e kw - 1 ku> + ~T ) ' 

The form of this series is very suggestive. Indeed we recognize any 
series \ A(kw) — Y1 A )ai° w as a Riemann sum, approximating 
the Riemann integral / 0 °° — { p-dt for small positive w. It should come 
as no surprise then, that such series are estimated rather accurately. 
So let us review the “Riemann sum story”. 



Riemann Sums 

Suppose that 0(x) is a positive decreasing function on (0, oo) and 
that h > 0. The Riemann sum YlkLi <P(kh)h is clearly equal to the 
area of the union of rectangles and so is bounded by the area under 
y — 0(x). Hence i <P(kh)h < / 0 °° <p{x)dx. On the other hand, 
the series Ylh= o 4>(kh)h can be construed as the area of this union of 
these rectangles and, as such, exceeds the area under y — 0(x). So 
this time we obtain Y1T=o 4>(kh )h > / 0 °° <p(x)dx. 

Combining these two inequalities tells us that the Riemann sum 
lies within h ■ <p (0) of the Riemann integral. This is all very nice and 
rather accurate but it refers only to decreasing functions. However, we 
may easily remedy this restriction by subtracting two such functions. 
Thereby we obtain 

oo r oc 

y^l [<j>(kh) - 1 J/(kh)]h - / [0(x) - t/t (jc)] < /*[</>( 0) + ^(0)]. 

Calling 4>{x) — iff (x) = F(x) and then observing that 0(0) + i/^(0) 
is the total variation V of F(x) we have the rather general result 

oo r oc 

V F(kh)h - F{x) « h ■ V(F). (4) 

k= i 

To be sure, we have proven this result only for real functions but 
in fact it follows for complex ones, by merely applying it to the real 
and imaginary parts. 
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To modify this result to fit our situation, let us write w — he' 9 , 
h > 0, — 7t/2 < 6 < 7r/2, and conclude from (4) that 

oo r oc 

V F(khe w )h - / F(xe w )dx « h ■ V e (F ) 

J° 

(Vg is the variation along the ray of argument 0), so that 

oo r oc 

V F(kw)w - / F{xe ie )d{xe w ) « w ■ V e (F). 

k~t Jo 

Furthermore, in our case of an analytic F , this integral is actually 
independent of 9. (Simply apply Cauchy’s theorem and observe that 
at oo F falls off like ). We also may use the formula V H (F) — 
/ 0 °° F'(xe lfi )\dx and finally deduce that 

00 pOO pOO 

y^F(kw)w — / F(x)dx w I \F'(xe ,e )\ dx. 

Jo Jo 



Later on we show that 



n 



1 1 e x \ dx 1 

“x + yj7 = og V57’ 



(5) 



and right now we may note that the (complicated) function 
F\xe w ) = 2 ^ e 



x 3 e 3w 



2x 2 e 2W 2xe 



iQ 



x 2 e 2 ' 9 (e xe ' e — 1) xe i9 (e xe ' 6 — l) 2 

is uniformly bounded by —jjy in any wedge \0\ < c < 7t/2(m + 
M(c)), so that we obtain 



OO 



E 



1 

k 



( 



1 



e kw _ l 



1 

kw 



g—kw 

~ Y ~ 



— log - Mw 

V2tt 



(6) 



throughout | arg w\ < c < .t/2. 
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The Approximation. We have prepared the way for the useful ap- 
proximation to our generating function. All we need to do is combine 
(1), (3), and (6), replace w by log - , and exponentiate. The result is 



n 



1 



1 



— z 



k 




in 



|1 -zl 
1 - \z\ 



< c. 



[l + 0(l-z)] 



But we perform one more “neatening” operation. Thus log - is 
an eyesore! It isn’t at all analytic in the unit disc, we must replace 
it (before anything good can result). So note that, near 1, log 7 = 

(l-z)+^ + ^+ -- - = 2{^ + 0((1 - z) 3 ), or ^ = 

| jz! + 0(1 — z )■ Finally then, 



OO 1 

n 

k=\ 



1 ex P ( TT ) [! + 0(1 ~ *)] ( 7 ) 



in 



2n 

11 ~ z\ 

1 - Izl 



12 1 



< c. 



This is our basic approximation. It is good near z = 1, which 
we have decided is the most important locale. Flere we see that 
we can replace our generating function by the elementary function 

y yy exp ( Jr ) w h° se coefficients should then prove amenable. 

Flowever, (7) is really of no use away from z = 1, and, since 
Cauchy’s theorem requires values of z all along a closed loop sur- 
rounding 0, we see that something else must be supplied. Indeed we 
will show that, away from 1, everything is negligible by comparison. 
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To see this, let us return to (2) and conclude that 

, , 1 v' 1 1 ^’ 1 V' 1 1 

bg^)- — g 77 



j~2 j 1 - W 1 
- l) 

i - ki V 6 r 



F(z ) <<C exp 



iiV t- • 



an estimate which is just what we need. It shows that, away from 1, 

where 77^— r is smaller than — , F(z) is rather small. 

|l-z| l-|zl ' 

Thus, for example, we obtain 

F(z) « exp — — — - when > 3. (9) 

1 - z\ 1 - \z\ 



Also, in this same region, setting 



1 - z 



n- 1 + z 



l^ exp 12 — 






/ 2 ( 7T 2 2 \ ( 7Z 2 2 

12— <<exp (i2 3UHijj 



so that 



0(z) <3C exp 



1 - |z| 



when — > 3. (11) 

1 - Izl " 



The Cauchy Integral. Armed with these preparations and the 
feeling that the coefficients of the elementary function <fi ( z ) are acces- 
sible, we launch our major Cauchy integral attack. So, to commence 
the firing, we write 

Ptn)- q (n)=— j, — dz 



( 12 ) 
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and we try C a circle near the unit circle, i.e., 

C is |z| — r, r < 1. (13) 

Next we break up C as dictated by our consideration of jqq , 
namely, into 

|1 - z\ 



A is the arc |z| 



r, 



1 



< 3, 



and 



(14) 



B is the arc |z| = r, - — > 3. 



So, 



Pin) - q {n) 



-[ 

\7U J A 



Fiz) - </>iz) 



_ . , . , ■ dz + _ 

2m J A z n+i 2ni j B z 



1 - Izl 



- f 

■zri J B 



(15) 



Fiz) - <Kz) 



n + 1 



dz, 



and if we use (7) on this first integral and (9), (11) on this second 
integral we derive the following estimates: 

f F(z) - Hz) 

2 71 i J A 



7 n + 1 



dz 



« 



M' 

yYl-\-\ 



(1 - r) 3/2 exp ( — 



Tt~ 



1 



6 1 — r 



x the length of A . 



(M' is the implied constant in the O of (7) when c — 3). 

As for the length of A, elementary geometry gives the formula 



4 r arcsin 



V2(l - r) 



and this is easily seen to be (9(1 — r). We finally obtain, then, 

1 f F(z) - Hz) , 

2 Tti J A z” +l 

' ' 2 1 



(1 - r) 5 / 2 . / tc 

M exp 



6 1 



(16) 



where M is an absolute constant. 
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For the second integral, 
F(z)-<Kz ) 



2 ni J B 



7 ^ 1+1 



dz < 



1 



2jzr n+l 



2 exp 



1 - r 



■ 27 rr 



2 / 1 
= 37 ex P 



1 — r 

And this is even smaller than our previous estimate. So combining 
the two gives, by (15), 



(1 — r) 5 / 2 in 2 1 
pin) - qiri) <SC M exp — 



6 1 — r 



(17) 



But what is r? Answer: anything we please (as long as 0 < r < 
1)! We are masters of the choice, and so we attempt to minimize 
the right-hand side. The exact minimum is too complicated but the 
approximate one occurs when n exp ( ^ 7^7 ) is minimized and 

this occurs when ^ +- = n( 1 — r), i.e., r = 1 — -§=. So we 
choose this r and, by so doing, we obtain, from (17), the bound 



p(n ) = <?(«) + <9 ( n-s/V+'+j . 



(18) 



The Coefficients of ^(n) 



The elementary function 0(z) has a rather pleasant definite integral 
representation which will then lead to a handy expression for the 
qin). 

If we simply begin with the well-known identity 




e 1 dt — s/tt 



and make a linear change of variables ( a > 0), 




a 
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or 



Thus if we set b 2 



L 



e ~a 2 t 2 e +2abt 



dt = 



— oo 

7T 2 1 



6 — and a~ = 1 — z (thinking of z as real 
( | z | < 1) for now), we obtain 



/; 



gU 1 e +n 






Vi 



l n 2 1 
exp 



6 1 — z 



which gives, finally, 



0(z) 



y—7T 2 /l2 



(1 -Z) 



/: 



nV2 

Equating coefficients therefore results in 






q(n) 



9 — tc 2 /\2 po o 



7T 



V2 J- 



L 



t 



2n 



t 



2n—2 



n\ (n — 1)! 



( 19 ) 



r Vv 3t-r- dt ( 20 ) 



the “formula” for q(n) from which we can obtain asymptotics. 

Reasoning that the maximum of the integrand occurs near t = «Jn 
we change variables by t = s + JTi , and thereby obtain 



where 



q(n) = C n 




K n ( s)2se 




r 

n 



K„(s ) = 



g 71 2 



izVln e n n\ 



1 + Vr 

2y/n 

( i + V l 



i + 





( 21 ) 



Since K„(s ) — » 1, we see, at least formally, that the above integral 
approaches 





~du, 
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where we have set s = 

odd, it is equal to 
becomes 



2 

: -£= + . Furthermore, since ue~ ir is 

fZc e ~ u du — . Thus (21) formally 



q(n) 



V2rmn n 
4V3 n e n n\ 



( 22 ) 



And score another one for Stirling’s formula, which in turn 
gives 



q(n) 



e Xy/2nl 3 

4V3 n 



(23) 



and our earlier estimate (18) allows us thereby to conclude that 



p(n) 




(24) 



Success l We have determined the asymptotic formula for p(n)\ 
Well, almost. We still have two debts outstanding. We must justify 
our formal passage to the limit in (21), and we must also prove our 
evaluation (5). So first we observe that xe~ x is maximized at x = 1, 
so we deduce that 



( 1 + —p= ) e < 1 

V sfn) 

(using jf = (1 + 4^)) and also 



(25) 



1 + 




e < e 2n 



(using jc = (1 + ^) 2 ). 

Thus using (25) for positive s, by (21), 

K n (s) < e s ~ for s > 0, 



(26) 




